Enhancement automatic speech recognition by deep neural networks
نویسندگان
چکیده
The performance of speech recognition tasks utilizing systems based on deep learning has improved dramatically in recent years by different designs and methodologies. A popular way to boosting the number training data is called Data Augmentation (DA), research shows that using DA effective teaching neural network models how make invariant predictions. furthermore, EM approaches have piqued machine-learning researchers' attention as a means improving classifier performance. In this study, been presenteded unique employs both improve system's prediction accuracy. firstly, reveal an approach vocal tract length disturbance already exists then propose Feature perturbation alternative approach. order amendment sets. This followed integration posterior probabilities obtained from several DNN acoustic trained diverse datasets. study's findings proposed skills improved.
منابع مشابه
Automatic Speech Recognition with Deep Neural Networks for Impaired Speech
Automatic Speech Recognition has reached almost human performance in some controlled scenarios. However, recognition of impaired speech is a difficult task for two main reasons: data is (i) scarce and (ii) heterogeneous. In this work we train different architectures on a database of dysarthric speech. A comparison between architectures shows that, even with a small database, hybrid DNN-HMM mode...
متن کاملAn Automatic Dysarthric Speech Recognition Approach using Deep Neural Networks
Transcribing dysarthric speech into text is still a challenging problem for the state-of-the-art techniques or commercially available speech recognition systems. Improving the accuracy of dysarthric speech recognition, this paper adopts Deep Belief Neural Networks (DBNs) to model the distribution of dysarthric speech signal. A continuous dysarthric speech recognition system is produced, in whic...
متن کاملBinary Deep Neural Networks for Speech Recognition
Deep neural networks (DNNs) are widely used in most current automatic speech recognition (ASR) systems. To guarantee good recognition performance, DNNs usually require significant computational resources, which limits their application to low-power devices. Thus, it is appealing to reduce the computational cost while keeping the accuracy. In this work, in light of the success in image recogniti...
متن کاملDeep segmental neural networks for speech recognition
Hybrid systems which integrate the deep neural network (DNN) and hidden Markov model (HMM) have recently achieved remarkable performance in many large vocabulary speech recognition tasks. These systems, however, remain to rely on the HMM and assume the acoustic scores for the (windowed) frames are independent given the state, suffering from the same difficulty as in the previous GMM-HMM systems...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Periodicals of Engineering and Natural Sciences (PEN)
سال: 2021
ISSN: ['2303-4521']
DOI: https://doi.org/10.21533/pen.v9i4.2450